NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Collection of Pragmatic-Similarity Judgments over Spoken Dialog Utterances.

Ward, N; Marco, D (May 2024, ELRA and ICCL)
Calzolari, N; Kan, M; Hoste, V; Lenci, A; Sakti, S; Xue, N (Ed.)
Full Text Available
ChainNet: Structured Metaphor and Metonymy in WordNet

Maudslay, R; Teufel, S; Bond, F; Pustejovsky, J (May 2024, ELRA and ICCL)
Calzolari, N; Kan, M; Hoste, V; Lenci, A; Sakti, S; Xue, N (Ed.)
The senses of a word exhibit rich internal structure. In a typical lexicon, this structure is overlooked: A word`s senses are encoded as a list, without inter-sense relations. We present ChainNet, a lexical resource which for the first time explicitly identifies these structures, by expressing how senses in the Open English Wordnet are derived from one another. In ChainNet, every nominal sense of a word is either connected to another sense by metaphor or metonymy, or is disconnected (in the case of homonymy). Because WordNet senses are linked to resources which capture information about their meaning, ChainNet represents the first dataset of grounded metaphor and metonymy.
more » « less
Full Text Available
Building a Broad Infrastructure for Uniform Meaning Representations

Bonn, J; Buchholz, M; Chun, J; Cowell, A; Croft, W; Denk, L; Ge, S; Hajic, J; Lai, K; Martin, J; et al (May 2024, ELRA and ICCL)
Calzolari, N; Kan, M; Hoste, V; Lenci, A; Sakti, S; Xue, N (Ed.)
This paper reports the first release of the UMR (Uniform Meaning Representation) data set. UMR is a graph-based meaning representation formalism consisting of a sentence-level graph and a document-level graph. The sentence-level graph represents predicate-argument structures, named entities, word senses, aspectuality of events, as well as person and number information for entities. The document-level graph represents coreferential, temporal, and modal relations that go beyond sentence boundaries. UMR is designed to capture the commonalities and variations across languages and this is done through the use of a common set of abstract concepts, relations, and attributes as well as concrete concepts derived from words from invidual languages. This UMR release includes annotations for six languages (Arapaho, Chinese, English, Kukama, Navajo, Sanapana) that vary greatly in terms of their linguistic properties and resource availability. We also describe on-going efforts to enlarge this data set and extend it to other genres and modalities. We also briefly describe the available infrastructure (UMR annotation guidelines and tools) that others can use to create similar data sets.
more » « less
Full Text Available
Linear Cross-document Event Coreference Resolution with X-AMR

Ahmed, S; M; Baker, G; Judge, E; Reagan, M; Wright-Bettner, K; Palmer, M; Martin, J (May 2024, ELRA and ICCL)
Calzolari, N; Kan, M; Hoste, V; Lenci, A; Sakti, S; Xue, N (Ed.)
Event Coreference Resolution (ECR) as a pairwise mention classification task is expensive both for automated systems and manual annotations. The task`s quadratic difficulty is exacerbated when using Large Language Models (LLMs), making prompt engineering for ECR prohibitively costly. In this work, we propose a graphical representation of events, X-AMR, anchored around individual mentions using a cross-document version of Abstract Meaning Representation. We then linearize the ECR with a novel multi-hop coreference algorithm over the event graphs. The event graphs simplify ECR, making it a) LLM cost-effective, b) compositional and interpretable, and c) easily annotated. For a fair assessment, we first enrich an existing ECR benchmark dataset with these event graphs using an annotator-friendly tool we introduce. Then, we employ GPT-4, the newest LLM by OpenAI, for these annotations. Finally, using the ECR algorithm, we assess GPT-4 against humans and analyze its limitations. Through this research, we aim to advance the state-of-the-art for efficient ECR and shed light on the potential shortcomings of current LLMs at this task. Code and annotations: https://github.com/ahmeshaf/gpt_coref
more » « less
Full Text Available
Encoding Gesture in Multimodal Dialogue: Creating a Corpus of Multimodal AMR

Lai, K; Brutti, R; Donatelli, L; Pustejovsky, J (May 2024, ELRA and ICCL)
Calzolari, N; Kan, M; Hoste, V; Lenci, A; Sakti, A; Xue, N (Ed.)
Full Text Available

Search for: All records